Search CORE

16 research outputs found

Topic models for short text data

Author: Paun Silviu
Publication venue
Publication date: 01/01/2017
Field of study

Topic models are known to suffer from sparsity when applied to short text data. The problem is caused by a reduced number of observations available for a reliable inference (i.e.: the words in a document). A popular heuristic utilized to overcome this problem is to perform before training some form of document aggregation by context (e.g.: author, hashtag). We dedicated one part of this dissertation to modeling explicitly the implicit assumptions of the document aggregation heuristic and applying it to two well known model architectures: a mixture and an admixture. Our findings indicate that an admixture model benefits more from aggregation compared to a mixture model which rarely improved over its baseline (the standard mixture). We also find that the state of the art in short text data can be surpassed as long as every context is shared by a small number of documents. In the second part of the dissertation we develop a more general purpose topic model which can also be used when contextual information is not available. The proposed model is formulated around the observation that in normal text data, a classic topic model like an admixture works well because patterns of word co-occurrences arise across the documents. However, the possibility of such patterns to arise in a short text dataset is reduced. The model assumes every document is a bag of word co-occurrences, where each co-occurrence belongs to a latent topic. The documents are enhanced a priori with related co-occurrences from the other documents, such that the collection will have a greater chance of exhibiting word patterns. The proposed model performs well managing to surpass the state of the art and popular topic model baselines

University of Essex Research Repository

Biblioteca Digital de la Comunidad de Madrid

Topic models for short text data

Author: Paun Silviu
Publication venue
Publication date: 01/01/2017
Field of study

University of Essex Research Repository

Scoring Coreference Chains with Split-Antecedent Anaphors

Author: Moosavi Nafise Sadat
Paun Silviu
Poesio Massimo
Yu Juntao
Publication venue: University of Illinois at Chicago Library
Publication date: 24/05/2022
Field of study

Anaphoric reference is an aspect of language interpretation covering a variety of types of interpretation beyond the simple case of identity reference to entities introduced via nominal expressions covered by the traditional coreference task in its most recent incarnation in ONTONOTES and similar datasets. One of these cases that go beyond simple coreference is anaphoric reference to entities that must be added to the discourse model via accommodation, and in particular split-antecedent references to entities constructed out of other entities, as in split-antecedent plurals and in some cases of discourse deixis. Although this type of anaphoric reference is now annotated in many datasets, systems interpreting such references cannot be evaluated using the Reference coreference scorer Pradhan et al. (2014). As part of the work towards a new scorer for anaphoric reference able to evaluate all aspects of anaphoric interpretation in the coverage of the Universal Anaphora initiative, we propose in this paper a solution to the technical problem of generalizing existing metrics for identity anaphora so that they can also be used to score cases of split-antecedents. This is the first such proposal in the literature on anaphora or coreference, and has been successfully used to score both split-antecedent plural references and discourse deixis in the recent CODI/CRAC anaphora resolution in dialogue shared tasks

University of Illinois at Chicago: Journals@UIC

arXiv.org e-Print Archive

Queen Mary Research Online

The algorithm for going through a labyrinth by an autonomous

Author: Adiran Paun
Anghel Drugarin Co
Draghici Silviu
Raduca Eugen
Raduca Mihaela
Publication venue: 'Universidad de Sevilla - Secretariado de Recursos Audiovisuales y Nuevas Tecnologias'
Publication date: 01/01/2015
Field of study

The paper presents an algorithm for going through a path type labyrinth by an autonomous vehicle. The detection of the path and the maintaining of the motion direction have been addressed as well as going through the labyrinth on road segments and the categories of crossings in the said labyrinth. The algorithm has been implemented in C++ and validated in an experimental model that has totally confirmed its correctness

idUS. Depósito de Investigación Universidad de Sevilla

A Probabilistic Annotation Model for Crowdsourcing Coreference

Author: Chamberlain Jon
Kruschwitz Udo
Paun Silviu
Poesio Massimo
Yu Juntao
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2018
Field of study

The availability of large scale annotated corpora for coreference is essential to the development of the field. However, creating resources at the required scale via expert annotation would be too expensive. Crowdsourcing has been proposed as an alternative; but this approach has not been widely used for coreference. This paper addresses one crucial hurdle on the way to make this possible, by introducing a new model of annotation for aggregating crowdsourced anaphoric annotations. The model is evaluated along three dimensions: the accuracy of the inferred mention pairs, the quality of the post-hoc constructed silver chains, and the viability of using the silver chains as an alternative to the expert-annotated chains in training a state of the art coreference system. The results suggest that our model can extract from crowdsourced annotations coreference chains of comparable quality to those obtained with expert annotation

University of Essex Research Repository

Crossref

Comparing Bayesian Models of Annotation

Author: Carpenter Bob
Chamberlain JD
Hovy Dirk
Kruschwitz Udo
Paun Silviu
Poesio Massimo
Publication venue: 'MIT Press - Journals'
Publication date: 01/01/2018
Field of study

The analysis of crowdsourced annotations in NLP is concerned with identifying 1) gold standard labels, 2) annotator accuracies and biases, and 3) item difficulties and error patterns. Traditionally, majority voting was used for 1), and coefficients of agreement for 2) and 3). Lately, model-based analysis of corpus annotations have proven better at all three tasks. But there has been relatively little work comparing them on the same datasets. This paper aims to fill this gap by analyzing six models of annotation, covering different approaches to annotator ability, item difficulty, and parameter pooling (tying) across annotators and items. We evaluate these models along four aspects: comparison to gold labels, predictive accuracy for new annotations, annotator characterization, and item difficulty, using four datasets with varying degrees of noise in the form of random (spammy) annotators. We conclude with guidelines for model selection, application, and implementation

University of Essex Research Repository

University of Regensburg Publication Server

Archivio istituzionale della Ricerca - Bocconi

Crossref

Queen Mary Research Online

A Case for Soft Loss Functions

Author: Fornaciari Tommaso
Hovy Dirk
Paun Silviu
Plank Barbara
Poesio Massimo
Uma Alexandra
Publication venue: AAAI Press
Publication date: 01/01/2020
Field of study

No abstract availabl

Archivio istituzionale della Ricerca - Bocconi

The IT University of Copenhagen's Repository

Association for the Advancement of Artificial Intelligence: AAAI Publications

Beyond Black & White: Leveraging Annotator Disagreement via Soft-Label Multi-Task Learning

Author: Fornaciari Tommaso
Hovy Dirk
Paun Silviu
Plank Barbara
Poesio Massimo
Uma Alexandra
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2021
Field of study

Supervised learning assumes that a ground truth label exists. However, the reliability of this ground truth depends on human annotators, who often disagree. Prior work has shown that this disagreement can be helpful in training models. We propose a novel method to incorporate this disagreement as information: in addition to the standard error computation, we use soft labels (i.e., probability distributions over the annotator labels) as an auxiliary task in a multi-task neural network. We measure the divergence between the predictions and the target soft labels with several loss-functions and evaluate the models on various NLP tasks. We find that the soft-label prediction auxiliary task reduces the penalty for errors on ambiguous entities and thereby mitigates overfitting. It significantly improves performance across tasks beyond the standard approach and prior work

Archivio istituzionale della Ricerca - Bocconi

The IT University of Copenhagen's Repository

Queen Mary Research Online

Learning from disagreement: a survey

Author: Fornaciari Tommaso
Hovy Dirk
Paun Silviu
Plank Barbara
Poesio Massimo
Uma Alexandra N.
Publication venue: 'AI Access Foundation'
Publication date: 01/01/2021
Field of study

Many tasks in Natural Language Processing (nlp) and Computer Vision (cv) offer evidence that humans disagree, from objective tasks such as part-of-speech tagging to more subjective tasks such as classifying an image or deciding whether a proposition follows from certain premises. While most learning in artificial intelligence (ai) still relies on the assumption that a single (gold) interpretation exists for each item, a growing body of research aims to develop learning methods that do not rely on this assumption. In this survey, we review the evidence for disagreements on nlp and cv tasks, focusing on tasks for which substantial datasets containing this information have been created. We discuss the most popular approaches to training models from datasets containing multiple judgments potentially in disagreement. We systematically compare these different approaches by training them with each of the available datasets, considering several ways to evaluate the resulting models. Finally, we discuss the results in depth, focusing on four key research questions, and assess how the type of evaluation and the characteristics of a dataset determine the answers to these questions. Our results suggest, first of all, that even if we abandon the assumption of a gold standard, it is still essential to reach a consensus on how to evaluate models. This is because the relative performance of the various training methods is critically affected by the chosen form of evaluation. Secondly, we observed a strong dataset effect. With substantial datasets, providing many judgments by high-quality coders for each item, training directly with soft labels achieved better results than training from aggregated or even gold labels. This result holds for both hard and soft evaluation. But when the above conditions do not hold, leveraging both gold and soft labels generally achieved the best results in the hard evaluation. All datasets and models employed in this paper are freely available as supplementary materials

Archivio istituzionale della Ricerca - Bocconi

Crowdsourcing and Aggregating Nested Markable Annotations

Author: Chamberlain Jon
Kruschwitz Udo
Madge Chris
Paun Silviu
Poesio Massimo
Yu Juntao
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2019
Field of study

One of the key steps in language resource creation is the identification of the text segments to be annotated, or markables, which depending on the task may vary from nominal chunks for named entity resolution to (potentially nested) noun phrases in coreference resolution (or mentions) to larger text segments in text segmentation. Markable identification is typically carried out semi-automatically, by running a markable identifier and correcting its output by hand–which is increasingly done via annotators recruited through crowdsourcing and aggregating their responses. In this paper, we present a method for identifying markables for coreference annotation that combines high-performance automatic markable detectors with checking with a Game-With-A-Purpose (GWAP) and aggregation using a Bayesian annotation model. The method was evaluated both on news data and data from a variety of other genres and results in an improvement on F1 of mention boundaries of over seven percentage points when compared with a state-of-the-art, domain-independent automatic mention detector, and almost three points over an in-domain mention detector. One of the key contributions of our proposal is its applicability to the case in which markables are nested, as is the case with coreference markables; but the GWAP and several of the proposed markable detectors are task and language-independent and are thus applicable to a variety of other annotation scenarios

University of Essex Research Repository

University of Regensburg Publication Server

Crossref